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A  BAYESIAN  APPROACH  TO 
SOME  BEST  POPULATION  PROBLEMS 

Irwin  Guttman  and  George  C.  Tiao 
I .  Introduction  and  Summary 

There  have  been  several  papers  recently  in  the  literature  devoted 
to  the  subject  of  selecting  a  "best"  population  —  see  for  example  Gupta 
and  Sobel  (1962),  Guttman  (1961)  and  others.  In  these  works,  the  problem 
was  analyzed  from  the  sampling  theory  point  of  view.  Except  for  some 
simple  cases,  this  approach  frequently  leads  to  the  problem  of  eliminating 
nuisance  parameters.  And,  unless  rather  strong  assumptions  about  certain 
of  the  parameters  involved  are  made,  the  problem  usually  becomes  intractable. 

In  this  paper,  we  consider  certain  best  population  problems  adopting 
a  Bayesian  approach.  One  main  advantage  of  such  an  approach  is  that  we 
are  able  to  arrive  at  satisfactory  decision  procedures  in  the  presence  of 
nuisance  parameters.  Indeed,  the  use  of  Bayes'  theorem  allows  one  to 
analyse  best  population  problems  from  many  points  of  view.  The  plan  of 
this  paper  is  as  follows: 

In  section  2,  a  general  description  of  best  population  problems 
is  first  outlined.  The  criterion  for  "bestness"  is  regarded  as  a  "utility" 
function  of  the  statistician.  The  decision  procedure  then  adopted  is 
based  upon  the  principle  of  maximizing  posterior  expected  utility.  In 
sections  3  and  4,  we  discuss  the  application  of  this  procedure  when  sampling 
from  normal  populations  and  exponential  populations,  respectively.  In  both 
cases,  the  criterion  defining  the  best  population  is  taken  to  be  the 
coverage  of  the  population  considered  in  a  certain  given  interval.  The 
procedures  are  shown  in  section  5  to  be  consistent.  Further,  in  section  6, 
we  extend  the  decision  analysis  by  considering  the  posterior  distribution 
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of  the  criterion.  This  extension  enables  us  to  propose  other  decision 
procedures  which  may  be  more  appropriate  in  certain  situations  than  that 
resulting  from  the  principle  of  maximizing  posterior  expected  utility.  Some 
examples  are  illustrated  in  detail.  Finally,  in  section  7,  we  discuss 
briefly  some  different  types  of  best  population  problems. 

II.  Statement  of  the  Problem 

We  are  given  a  collection  n  of  k  populations  n  ,  n^,...,  H. ,. . . , 
in  which  there  exists  a  so-called  "best"  population,  where  the  population 
is  "best"  according  to  a  specific  definition  of  the  following  kind. 

Suppose  the  Jl^'s  are  distributed  with  probability  density  function 
f(x|ei),  where  is  possibly  vector-valued.  Consider  a  real  valued  function 
of  0£,  tu  s  g(©^),  where  g  is  known.  Then  the  "best"  population  is  defined 
to  be  that  population  which  has  largest  value  among  the  h^  =  g(0^) , 
i  »  1,  ...,  k.  We  assume  there  is  an  ordering  of  the  h^,  . ..,  hfc  into 

h  <  h  ...  <  b  <  h 

Cl]  [23  Ck-l]  [k] 

As  we  are  interested  in  determining  which  of  the  k  populations  has 

as  its  value  of  h,  the  value  h  ,  and  in  accordance  with  the  Bayesian 

L*'  J 

approach,  we  may  therefore  regard  the  function  h^  =  gC©^)  as  the  utility 
function  of  the  statistician  re  the  population  Jl^.  The  statistician's 
procedure  will  be  as  follows: 

From  each  of  the  TI^  select  a  random  sample  ^  of  size  n^.  Determine 
the  a  posteriori  distribution  p(QjJ^)  of  0^  given  the  sample  and  hence 
find  E(hjJ^),  the  expectation  of  h^,  i  =  1,  . ..,  k.  The  statistician  will 
then  choose  as  the  "best"  population  that  population  which  assumes  the 
maximum  value  of  these  expectations.  That  is,  the  one  with  value 
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max  E(h^|^)  e  max  /  hi  P<©iIZi.)  d6i  • 

In  sections  3  and  4,  we  apply  this  approach  to  two  particular 

a  2 

cases.  We  define  h^  *  g(6^)  *#/  f(x|0^)  dx,  where  and  a2  are  known, 
and  consider  the  cases: 

(a)  f  is  the  normal  density,  and 

(b)  f  is  the  exponential  density. 

This  problem  with  h^  defined  as  above,  has  been  considered  by  Guttman 
(1961)  from  the  non-Bayesian  point  of  view.  We  note  that  several  of  the 
more  interesting  cases  which  lead  to  considerable  mathematical  difficulties 
when  the  non-Bayesian  approach  is  used  are  readily  solvable  adopting  the 
above-mentioned  procedure  (see  below). 


The  Case  of  the  Normal  Pena 


When  the  populations  are  normally  distributed,  we  have  that 
*2 

\  ■  *(“i-  v  ■  / exp  r  ^  w2 } dy- 

ai  -  1 

Let  a  sample  ^  M  (yn . yin  )  be  taken  from  1^.  We  have  for  the 

likelihood  function: 

*<nt.  exp  j-  ^  il\  <ytj  -  ^>2 } 

(3.1) 

-  o'* *■  exp  {-  tni  *\  +  ni  (*i  *  ^i)2]  } 

2  2 
where  and  s‘  are  the  maximum  likelihood  estimators  of  and  dj, 


respectively. 
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Assume  that  we  are  in  a  situation  where  little  is  known  a  priori 
re  the  values  of  (4^,  o^),  for  all  i.  In  other  words,  we  are  saying  that 
the  information  we  have  re  (4^,  o  )  comes  primarily  from  the  sample 
We  may  then  adopt  the  approach  used  by  Jeffreys  (1961),  Savage  (1961), 
and  Box  and  Tiao  (1962),  and  assume  that  4^  and  log  0^  are  independent 
and  locally  uniformly  distributed  a  priori.  That  is. 


(3.2) 

all  i. 

(3.3) 
where 

(3. 
and 


pGij)  « 

P (log  crt)  «  k2  or  pfaj) 


Using  (3.2),  the  joint  posterior  distribution  of  (4^,  a.)  is: 

P^i*  tfJ*i>  =Pl(ail8i>  P2(Pil°i'  V 


.4.)  P^l.p  -  2  {r  <^>}  ««p  {-  rj1 } 


2  V1 


(3.4b)  p2(^i|ai»  yt)  exP  {  ‘  Pp2}  * 

We  note  from  (3.3)  that  the  adoption  of  the  prior  distributions  (3.2) 
amounts  to  saying  that  the  Joint  posterior  distribution  of  4^  and  log 
is  approximated  by  the  likelihood  function  (3.1). 

Consider  first  the  case  where  a^>-«.  Then,  the  a  posteriori 
expectation  of  h^  is 

00  «0 

Bog*!)  «/  /\  p^tK)  Pz^t^i'  °t>  doi  d^t 

-»  o 

00  00  a2 


JJJ  °*i(ni+2>  “K’  t1  t0ii'Z)2  ^i^i^i'^23} 


dz  do^  d4^. 
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It  is  clear  that  we  may  reverse  the  order  of  integration.  On  integrating 


out  and  ^  we  obtain 


.  ,  ,  '2  f2f  61"*)*  )  ‘  ~2 

<3'5)  i  *• 

That  is,  we  have  the  following  remarkably  simple  result 

(3.6)  BfrJj,)  -  Fvl(tl) 

where  F  is  the  cumulative  distribution  function  of  the  Student-t  variable 

V1  4  V*i 

with  (n  -1)  degrees  of  freedom  and  t,  =  (~j)  ( — - — ). 

1  1  ni 

Similarly,  if  a^  is  finite,  we  find  that 

(3.7)  KhJ*,)  -  fv,  tt<2)J  -  fvl(=!1)l, 

where  F  is,  as  before,  the  Student-t  cumulative  distribution  with  (n^-1) 
degrees  of  freedom  and 

(3  8)  ta) .  t(2)=  (vi^ 

(3.8)  tJL  -  ^  +1>  (  ),  ti  =  in^+1)  (  §1  )  • 

The  statistician's  procedure  is  now  clear.  If  one  behaves  as 
a  maximizer  of  his  expected  utility  --  see  for  example  Luce  and  Raiffa 
(1957),  and  Raiffa  and  Schlaifer  (1961),  he  computes  the  expectations 
(3.6)  or  (3.7)  depending  on  whether  a^  *  -«■  or  a^  is  finite,  respectively, 
for  each  population.  He  then  chooses  as  the  best  population  the  one  with 
the  sample  giving  the  maximum  value  of  these  expectations. 

In  the  case  where  a^  =  -«  (this  will  be  called  the  one-sided  case), 
we  note  that  if  n.  «=  n,  all  i,  then  this  selection  procedure  is  equivalent 

V*i 

to  choosing  the  population  which  has  as  value  of  ,  the  value 

"i 
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.  This  is  an  intuitively  pleasing  result  for  the  following 

i  J 

reason.  We  are  interested  in  the  best  population,  that  is,  the  population 


which  has  largest  value  of 
h 


1  ”  °t>  "  exP  {-  77  <X-‘V2}  dx 

-o»  i  _  20,  ' 


■j  jk  exp  {-  *2} dt' 


and  since  this  is  a  monotone  increasing  function  of  the  upper  limit,  say 

V^i 

t.  *  1 ,  the  best  population  is  that  with  largest  value  of  t,.  It  is 

i  a  iy 

intuitively  evident  that  an  estimate  of  t  should  be  based  on  . ~ . .  and 

V*i  81 

hence,  that  the  largest  value  of  — —  should  be  indicative  of  the  best 


population. 


IV.  The  Case  of  the  Exponential  Density 


In  this  section  we  discuss  the  situation  in  which  the  populations 
II  have  exponential  distributions.  That  is,  their  probability  density 


functions  fCyj&p  take  the  form: 


(4.1)  f(y|nt,  c^)  «  |  °i 

V  0 


y  > 

otherwise . 


We  take  up  the  one-sided  case  first,  namely,  we  let  =  a,  and  a^  = 

Then,  h^  becomes 

a 

(4.2)  h£  -  gO^,  at)  mJ  vt)  dy. 

Let  a  sample  ^  of  size  n^  be  taken  from  n^,  i  =  1,  ...,  k,  and  the 
observations  in  ^  be  ordered,  that  is  we  let  ^  =  (y^,  •••»  yln  )  where 
here  y^  <  y^  <  •••  <yin  •  Th®  likelihood  function  is  given  by: 
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(4.3)  10v  aj^)  -  a‘tni  exp  j-  ^  (y^  -  Ht)} 

" ffIni exp f  7[ [ <nil)  wt +  } 

where  Wi  -  (n^l)'1  (y^-y^- 

It  is  to  be  noted  that  for  all  o^  >  0  the  likelihood  function  (4.3) 
is  a  monotonic  increasing  function  of  in  the  interval  (-•,  y^)  and 
vanishes  outside  this  interval.  Two  interesting  cases  may  new  arise, 
namely,  (1)  yu  <  a,  and  (2)  yn  >  a. 

For  a  given  population  suppose  case  (1)  occurs.  This  implies 
that  <  a  and  hence  (4.2)  becomes 

(4.4)  h£  «  1  -  exp  . 

As  in  section  3,  we  assume  that  the  prior  distributions  for  and 
log  are  locally  uniform.  The  joint  posterior  distribution  of  and 
then  takes  the  form: 


(4.5) 

where 

(4.6a) 

and 

(4.6b) 

Hence , 


p(nt.  -  p3(°i|wi)  p^uJv  yn> 


p3(oi'wP 


r(nt-l) 


-nt  f  <Vl)wil_  „  . 

t  1  exp  4-  -  K  ot  >  0 

v*  O .  J 


ni  f  ni  1 

v  ytl)-  -±  exp  {■  (yii^i>|  • 


»*i  <  yii- 


the  expected  utility  is: 


E^ilii)  “//> i  P3(o1|wl)  P4(ni|a1,  ill)dol  d4i 


-  1 


n^+1 


a-y 


il 


(ni-l)wi 


--(n^l) 


(4.7) 
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We  again  note  that  this  result  (4.7)  is  intuitively  pleasing. 


To  begin  with,  we  are  interested  in  the  population  with  largest  value  of 


h^  as  given  in  (4.4),  which  is  a  monotone  increasing  function  of 


Hence,  the  population  with  largest  value  of 


a-(i. 


is  the  best  population. 


It  seems  natural,  therefore,  that  a  selection  procedure  fcr  the  best 

a_yil 

population  should  be  based  on  — —  .  In  the  special  case  n  *  n,  this 

wi 

leads  us  to  choose  that  population  which  yields  the  sample  with  largest 


value  of 


a-y 


il 


w, 


.  The  above  reasoning  is  borne  out  in  expression  (4.7), 


since  it  is  a  monotonic  increasing  function  of 


a-y 


il 


w. 


Suppose  now  that  for  a  population  case  (2)  arises;  then  there 
is  no  information  from  the  sample  to  tell  us  which  of  the  situations 
(i)  a  <  p^  or  (ii)  ^  <  a  obtains.  If,  in  fact,  our  prior  information 
includes  the  knowledge  that  a  <  p^,  then  the  utility  for  that  population 
is  zero  and  this  population  should  be  dropped  from  further  consideration. 

On  the  other  hand,  suppose  that  a  priori  little  is  known  about  p^, 
except  that  it  cannot  exceed  "a",  and  hence  the  utility  h^  is  again  given 
by  (4.4).  As  before,  one  can  approximate  the  posterior  distribution  of  p^ 
and  log  by  the  likelihood  function,  but  with  the  extra  restriction 
p^  <  a.  We  then  have  for  the  joint  posterior  distribution  of  p^  and  o^, 

(4.8)  p(pt,  ajj^)  3  PsCuJ0!'  *>  P6(° Jyi’  a> 


where 
(4 
and 


ni  f  “i  1 

•  9a)  PjtnJv  a>  -  3“  exp  I  -  —  (a-p^f  » 


<  a, 


(4.9b)  p6(o 
with  y 


U^-a)]*"!'1*  r  nt(yt-a) 

ilyi’  a)  "  '  r(nt-l)  °i  1  exp  \  a±  J 


_i  21  y 

nt  yij* 


i 
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Using  (4.8)  we  find  chat  the  expected  utility  is  simply 


(A.  10) 


V1 


We  note  that  the  expected  utility  in  (4.10)  approaches  zero  as  n^ 
tends  to  infinity.  This  is  as  it  should  be  for  here  <  *  <  y^i  and 
therefore,  as  n^  -*  «,  y^  converges  (in  probability)  to  ^  with  the  result 
that  the  utility  (4.4)  must  approach  zero.  This  has  an  interesting 
implication;  when  a  <  y  and  for  a  large  sample,  we  have  that  the  posterior 
expectation  for  our  utility  is  either  approximately  (if  <  a)  or 
exactly  (if  a  <  p  )  zero.  Thus,  for  a  large  sample,  irrespective  of  whether 
or  not  we  know  that  is  greater  or  less  than  "a",  the  population  should 
be  dropped  from  further  consideration. 

Further,  suppose  the  sample  sizes  are  small  and  that  the  inter¬ 
mediate  situation  occurs,  that  is,  yJ]L  <  a  for  some  of  the  and  a  <  y^ 

for  the  remaining  ones.  In  the  event  that  n^  ■  n,  all  1,  the  expression 
in  (4.10)  is  always  less  than  that  in  (4.7).  The  statistician's  procedure 
is  then  clear  --  he  will  ismediately  drop  from  further  consideration  those 
populations  with  a  <  y^  and  select  the  best  population  from  the  remaining 
ones. 

We  now  consider  the  "two-sided"  case  for  the  exponential,  that  is, 
we  will  be  interested  in  the  population  having  largest  value  among  the 
quantities: 

*2 

(4.11)  ht  a  g(^,  at)  mj  exp  /-  (x-1^)  j-  dx 

.  1  '  i 

al 

where  a^  and  a^  are  known  constants. 

For  a  given  population  we  shall  only  be  concerned  with  the 


following  two  cases: 


10. 


(4.12a) 

Hi  <  yu  <  • 

(4.12b) 

<  *i  <  y 

<  a 


11 


<  a 


2* 

* 

2‘ 


and 


Paralleling  Che  above  development,  we  find  that  the  expected 
utilities  are 


(4.13)  ICbJ,,)  -  [  {» 


vyn  i  -(y1) 

(ni-l)wi [ 


jiiiu  ’(vl> 


under  (4.12a),  and 


(4.14)  8(hiUi) 


ni+l 


1 


a2  -  »n  -(nt-D 

ni<yi-ai>J 


when  (4.12b)  holds. 

For  the  two-sided  case,  the  procedure  will  be:  Upon  taking  a 
sample  of  size  n^  from  each  of  the  11^,  compute  the  expectation  in  (4.13) 
or  (4.14)  according  to  whether  condition  (4.12a)  or  (4.12b)  obtains.  Then 
choose  as  the  best  population  the  one  which  has  the  largest  expectation. 


V.  Consistency  Properties  of  the  Procedure 

We  show  in  this  section  that  the  procedures  developed  in 
sections  3  and  4  have  a  very  interesting  property,  namely  that  of  con¬ 
sistency. 

Consider  first  sampling  from  normal  populations (section  3).  For 
the  two-sided  case,  the  a  posteriori  expectation  of  the  utility  h^  is 
given  by  (3.7).  This  is  the  area  between  t^  and  t^  under  a  3tudent-t 
distribution  with  (n^-1)  degrees  of  freedom.  As  n^  tends  to  infinity,  we 
see  from  (3.7)  and  (3.8)  that  this  expectation  approaches  the  value 

*As  in  the  one-sided  case,  we  are  assuming  here  that  we  have  a 
priori  information  that  is  less  than  a^. 


•2-y 


11. 


<5.0  */_  jfe 


-it2 


dt 


1  aryi 

•i 

where,  of  course,  y^  and  s^  are  the  observed  values  of  the  mean  and 
standard  deviation  of  an  infinitely  large  sample. 

From  the  posterior  distribution  p(p^,  cJjJ^)  in  (3.3),  it  is 


straightforward  to  verify  that  the  second  moment  of  h^  is: 

t<2)  t<2)  ,  2  n  +1 

(5.2)  E(h2|*  ) - f1  J1  [l+  ■1-72'27^1]  2  dy  dy 

1  1  2x(l-p2)*  ~(l)  £<1)  L  (n^lHi-pJ)  J  1 


with  pi  "  n^+T 


The  expression  in  (5.2)  is  a  bivariate  t-integral  --  see  Dunnett 
and  Sobel  (1954).  As  n^  -*  «,  p^  will  tend  to  zero  and  the  above  integral 
approaches  the  following  limit: 


A  1 

(5.3)  U»  _  EChJhj)  f  .xp  {-  i  (t2*.2)} 


dt  du 


which  is  of  course  the  square  of  (5.1).  That  is  to  say,  in  the  limit  the 
variance  of  h^  approaches  zero.  Thus,  we  have  the  important  result  that 
the  utility  h^  converges  (in  probability)  to  the  value  (5.1).  Since  the 
best  population  is  the  one  with  the  largest  value  of  the  quantities  in  (5.1), 
i  ■  1,  ...,  k,  our  procedures  described  in  section  3  insures  that  for  large 
samples  the-  best  population  will  always  be  chosen.  It  will  be  seen  that 
this  large  sample  property  also  holds  for  the  one-sided  case,  l.e.  with 
a^  ■  «. 

Turning  to  sampling  from  the  exponential  (section  4),  we  now 
demonstrate  the  consistency  property  for  the  one-sided  problem  when  y^  is  less 


12. 


than  "a".  Here,  the  utility  is  given  in  (4.4).  It  Is  straightforward 
to  verify  that  the  rC^  moment  of  h^  is 


(5.4) 


E(hjlii)  = 


ilo 


(-i  r 


(nt\  r  *<»-y n>\  -(Vl) 

\ni+V  t1  (n1-l)wiJ 


where  we  recall  that 


w  s  ■  yi  (v 
Wi  ni-l  j=2  ^yij 

we  have  that: 


y^  is  the  first  order  statistic  of  the  sample  ^  and 
y^).  In  particular,  by  setting  r  =  1  and  r  =  2, 


(5.5a)  EOiJjq) 
and 


1  - 


ni+1 


a-y 


1  + 


il 


(ni-l)wt  j 


l'^'D 


n.  r 

,  2(a’yii)r(vl> 

/-i-Y 

rl+  J2u_T 

at  1 

v2  L 

(n1-l)w1 

vni+l/ 

(nt-l)w^ 

The  expression  (5.5a)  is  of  course  the  expected  utility  as  given  in  (4.7). 
Now,  as  n^  -*  »,  the  expressions  (5.5a)  and  (5.5b)  tend  to,  respectively, 


(5.6a)  lim  EOiJ.^) 
n^-»  « 

and 


1  - 


a-y 


il 


w. 


} 


(5.6b)  lim  Var  (h  jjq)  =*  0. 
n^  -* «  1 

In  a  similar  way  one  can  show  for  the  other  cases  discussed  in  section  4 
that  the  utility  functions  involved  are  such  that  they  converge  in 
probability  to  finite  limits. 


VI .  Distribution  Theory  of  the  Utilities  «  r(Q^) 

Since  the  utility  h^  is  a  function  of  6^,  then  once  we  have  the 
posterior  distribution  of  0^,  we  can,  in  principle  at  least,  determine 
the  posterior  distribution  of  h^.  It  is  important  to  study  this  distribution, 
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since  1C  provides  ell  the  relevant  information  about  h^.  In  thle  section, 
the  posterior  distribution  of  is  illustrated  for  the  one-sided  cases 
of  sampling  from  both  normal  and  exponential  populations. 

For  the  normal,  the  joint  posterior  distribution  of  and  o^ 

Is  given  by  (3.3).  We  now  make  the  transformation 

im 

hi  "  S(|V  ffi)  mJ  Jto  exp  f  *2}  dt  “  *  (^) 

(6.1) 


where  $  is  t'ne  cumulative  distribution  function  of  the  standard  normal 
variable . 

The  absolute  value  of  the  Jacobian  J  of  the  transformation  (6.1) 
is  easily  seen  to  be  given  by 


,  . _  r(e?H,)2  1 

(6.2)  |j|  =  v2x  exp  | — j  . 

Using  (6.1)  and  (6.2),  and  upon  integrating  over  the  range  of  v^  we 
find  that  the  (marginal)  posterior  distribution  of  h^  is: 

(6.3)  Pttj^)  -  k4  exp  |-  -j  [  -13  j  J  t  exp  |-  - -^)  |dt 


where 


n.-3 


n.-l 


.  -1  ,2  r  .V1.  ‘2  ,  2  "  2 

k4  “  2  r  <  2^  ni  (si> 


m^,  »  (ht)  and  bi  «  -  a2» 


Since  m^  is  an  inverse  function  of  a  cumulative  normal,  it  is 
not  possible  to  express  the  density  in  (6.3)  in  a  closed  form.  Nevertheless, 
it  is  easy  to  show  that  this  density  can  be  put  into  the  alternative  form: 
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2  2 

r  m.  n.B.  1 

(6.4)  pOiJj^)  =  k4v/2it  exp  j-  2^22  * 

v3  ‘1+bl 


where 


n  ty  “"T"  n ,  •  2  n, -2-p 

1  "!«•?"*?»  piS  <  p  ><-'!>  W 


Tp<cl>  -Jto  J3'  4.  “1  -J^=^ 

-'i  •;*»; 


The  function  T  (c^  may  be  calculated  using  the  following  reduction 


formulae , 


P  =  2r  +  Is  Tp(c)  =  2r  r!  ®'(c)  £|Q  (f-J* 

(6‘5)  ,  i.  r  .  2  l-\ 

p  «  2r :  Tp(c)  -  2<r^(r-£)!  *'(c)  J£l  <f-> 

+  2  [  1  -  4>(c)]  . 

<£-i) : 

Thus,  making  use  of  an  electronic  computer,  and  consulting  those 
standard  normal  tables  which  would  permit  accurate  inverse  interpolation, 
we  would  be  able  to  calculate  the  density  p(h^J^).  Once  this  is  calculated, 
the  posterior  probability  integrals  can  then  be  approximated  using  stscccird 
numerical  methods,  for  example,  Simpson's  Rule,  etc. 

For  the  exponential  distribution  as  defined  by  (4.1),  we  have 
found  that,  when  y^  is  less  than  "a",  the  posterior  distribution  of 
and  is  given  by  expression  (4.5).  We  now  make  the  transformation 


Oj)  -  1  "  e3«P 


vi  3  V 


An  equivalent  form  of  this  reduction  formula  is  given  iu  I lihor 


(1961). 
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The  absolute  value  of  the  Jacobian  of  the  transformation  (6.5) 
v . 

is  |j|  =  y~-  .  Thus,  the  (marginal)  posterior  distribution  of  h^  is 
given  by 

n.-l  m 

n.[  (n.-l)w  ]  1  n£-l  r  -n  ,  n  'l 

(6.7)  - <l-V  J  vi  exp  {-  ^  dvi 

1  a(h1)  1 

where 


a(h i )  “  "  log  (1-1^)  * 

For  values  of  "a"  different  from  y^  and  upon  making  the  transformation 
x  «  n^^  '  *)  v^1,  the  integral  in  (6.7)  can  be  written: 


.  .  -(n.-l)  (t  n  -2 

(6.8)  tn^y^  ■  »)1  J  x  1 


“X  j 

e  dx 


n.(y.  -  a) 


d  a  — - 4 - 

i  a(h1) 


On  integrating  by  parts,  (6.8)  takes  the  form 

_  -(n.-l)  r  n  -2  d*  ) 

(6.9)  (n1-2)!  [njL(y1-a)]  jl  -  exp(-dt)  ^  J  • 

Using  (6.9)  we  then  have  the  alternative  form  for  the  posterior  distribution 


of  h4. 


r(n.-l)w.'Hn^-l)  n,-l  r  ni'7-  ‘■'I 

(6.10)  Mbh)  .  n  Mr— *  «-•>!>  {l-.xp(-V.Io  r :  / 

Ln1Cyi-a)-  *■  1 

From  (6.10)  it  is  easily  shown  that  the  posterior  probability 


(6.11)  Pr(ht  >  h^l^) 


l“i'‘a.h  >"1 .  -i^u’  r.„  \\2  i  ±° 

»-ni^yi"*) J  hl0^  (n^l^  ^  \  ioj  rSO  v=0  vi  [  ^ 

(n  -l)w^ 

where  u^o  «  ~  .  We  note  that  expressions  (6.10)  and  (6,11)  a-.e 


amenable  to  calculation. 
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As  an  example  of  the  resulting  posterior  distribution  for  the 
exponential  case  discussed  above,  three  samples  of  size  ten  each  from 
exponential  populations  with  (nt>  -  (0,  1),  (.1,  1.1),  and  (.2,  1.2) 
were  drawn.  The  sample  information  is  summarized  as  follows: 


Sample  No. 

1 

1 

1 

yil 

.0316 

.2286 

.5799 

Wi 

.9238 

.9744 

1.136 

yi 

.8630 

1.106 

1.602 

The  value  of  "a"  chosen  is  1.20.  The  resulting  posterior  distributions 
are  plotted  in  Figure  6.1. 

We  see  from  the  diagrams  that  the  posterior  distribution  for 
population  3  is  nearly  symmetric.  However,  the  posterior  distributions 
for  populations  1  and  2  are  skewed,  the  former  being  more  so  in  this 
respect  than  the  latter. 

In  fact,  on  examining  the  moments  of  h^  given  in  (5.4),  it  is 
evident  that  we  should  expect  some  skewness  of  the  above  type.  It  is 
for  this  reason  that  a  desire  for  a  "conservative"  decision  rule  might 
make  itself  felt.  For  example,  we  might  find  that  the  situation  depicted 
in  Figure  6.2  might  obtain. 

Here  we  have  found  that  the  expected  values  of  h^  (i  =  1,  2)  are 
nearly  the  same,  yet  as  we  are  interested  in  picking  the  best  population, 
that  is,  the  population  with  the  largest  value  of  h^,  we  are  inclined  to 
favor  population  1  since  the  a  posteriori  probability  of  h^  exceeding 
its  expected  value  is  obviously  larger  than  the  a  posteriori  probability 
of  h^  exceeding  this  value.  This  leads  us  to  propose  the  following 
procedure  which  may  be  favored  by  some  experimenters,  namely, 


■.I";  r-.‘ 


pO^Te^or  Dfei^Utfor'5  dy  Jti  ^vc  "R)0 
^fepM.WtTc»\5  hav'hvj  Sox^ti,  edihX 
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(i)  As  In  sections  3  and  4,  find  max  Let  this  value 

be  denoted  by  y  . 

(ii)  Determine  the  a  posteriori  probability  that  h^  exceeds  the 
quantity  yQ,  say  P^,  for  each  population.  Make  the  decision  that  the 
population  having  as  its  value  of  P,  the  value  max  P^,  is  the  best  population. 

Other  procedures  of  this  type  suggest  itself  at  this  point.  For 
example,  suppose  the  statistician  is  concerned  with  a  mass  production 
process  in  which  items  are  defined  to  be  non-defective  if,  re  a  certain 
specification,  they  measure  less  than  the  quantity  "a".  Then  obviously, 
among  k  processes  producing  this  item,  the  statistician  wishes  to  choose 
the  process  yielding  the  largest  proportion  of  items  which  measure  less 
than  "a”.  Further,  suppose  that  the  processes  are  costly  and  that  the 
break  even  point  is  accomplished  if  and  only  if  the  proportion  of  non¬ 
defectives  is  greater  than  a  known  percentage,  say  100  yX.  Then  one  would 
expect  the  statistician  to  choose  as  the  best  population  the  one  having 
largest  value  for  the  posteriori  probability  P^,  *  Pr  th^  >  7lij]  .  tn 
fact,  statisticians  may  be  faced  with  situations  which  would  call  for  rules 
of  this  type  but  using  different  choices  of  y.  The  value  of  y  chosen 
would  depend  entirely  on  the  particular  problem  involved. 

We  remark  that  knowledge  of  the  posterior  distribution  has 
enabled  us  to  arrive  at  the  decision  rules  of  the  types  described  in  this 
section.  It  seems  to  us  entirely  natural  to  base  our  decision  upon  the 
posterior  distribution,  in  preference  to  the  rules  exemplified  by 
sections  3  and  4,  which  are  based  on  maximizing  the  expected  value 

*We  are  indebted  to  Professor  David  Finney  (1962)  for  this 


remark. 
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E(h^i^).  For,  after  all,  the  posterior  distribution  gives  us  all  the 
Information  about  h^  including  its  expected  value.  Of  course,  because  of 
the  consistency  property  of  the  procedures  of  sections 3  and  4  (as  shown  in 
section  5),  we  recommend  that  the  procedures  of  this  section  be  used  only 
for  small  samples. 

VII.  Some  Other  Best  Population  Problems 

The  problems  discussed  in  sections  3  and  4  involved  "utility" 
functions  h^  «  g(e  )  which  are  rather  complicated  functions  of  the 
population  parameters.  The  analysis  used  in  these  two  sections  was  that 
dictated  by  the  general  framework  described  in  section  2.  Using  the  same 
approach,  we  turn  now  to  some  other  best  population  problems  where  the 
definitions  of  "best"  used,  involve  h^  which  are  simple  functions  of  either 
a  location  or  a  scale  parameter  of  the  population  distributions.  The 
specific  h^  and  their  expectations  are  summarized  in  Table  7.1. 

^ble  7.1 


Utility  Expected  Utility  When  Sampling  From 


h^gCSj) 

'xp  f 

yt  1-y 

^i  (1^i>  1 

<”l  /  »»1 

Vi  +  ‘i 

**1 

yi 

yil  ni(n1  -  2) 

ni  +  ii  +  ni 

JL 

/*  r  4> 

1 

°i 

Il.-l 

^"ni  r  ^2 — *  *i 

wi 

Prior 
Distri- 
but  ions 
Of 

Pa-|mr 

pC^)  «  k 

P(0  « 
i  ot 

pfoj)  «  k 

p(ffi)  -  J- 
i  ot 

r<Vmf) 

P<“1>’  rjTjTOp 
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These  types  of  boat  population  problem*  summarized  above  have 
been  conaldered  from  the  non- Baye elan  point  of  view  --  aee  Gupta  and 
Sobel  (1960,  1962),  Bechhofer  (1954),  and  Bechhofar  and  Sobel  (1954)  — 
and  from  the  Bayesian  point  of  view  by  Ralffa  and  Schlalfer  (1961) .  It 
la  easy  to  show  that  the  results  listed  In  Table  7.1  are  consistent. 

Also,  these  problems  can  of  course  be  treated  using  the  analysis  discussed 
in  section  6. 
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